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CUGBP1 and MBNL1 are developmentally regulated RNA-binding proteins that are causally associated 
with myotonic dystrophy type 1. We globally determined the in vivo RNA-binding sites of CUGBP1 and 
MBNL1. Interestingly, CUGBP1 and MBNL1 are both preferentially bound to 3' UTRs. Analysis of 
CUGBP1- and MBNLl-bound 3' UTRs demonstrated that both factors mediate accelerated mRNA decay 
and temporal profiles of expression arrays supported this. Role of CUGBP1 on accelerated mRNA decay has 
been previously reported, but the similar function of MBNL1 has not been reported to date. It is well 
established that CUGBP1 and MBNL1 regulate alternative splicing. Screening by exon array and validation 
by RT-PCR revealed position dependence of CUGBP1- and MBNL1 -binding sites on the resulting 
alternative splicing pattern. This study suggests that regulation of CUGBP1 and MBNL1 is essential for 
accurate control of destabilization of a broad spectrum of mRNAs as well as of alternative splicing events. 



I n recent years, new technologies, such as microarray analysis and high throughput sequencing, have dramat- 
ically changed our knowledge on gene expression and revealed that extensive regulation takes place during 
posttranscriptional RNA processing 1 . Numerous RNA processing elements and regulatory RNA-binding 
proteins play together in a finely tuned interplay to ensure that different mRNAs are made from the primary 
transcript from a gene and are present in the right cell at the right time and in the correct amounts. Such complex 
regulation is of course vulnerable and a rapidly increasing number of human diseases are now known to be caused 
by misregulated RNA processing 2 . An intriguing example where this kind of disease mechanism is in operation is 
myotonic dystrophy type 1 (DM1), where aberrant regulation of two RNA-binding proteins, CUG-binding protein 
1 (CUGBP1) and muscleblind-like 1 (MBNL1) co-operationally cause some of the disease symptoms. DM1 is the 
most common form of myotonic dystrophy (DM), and is caused by an expansion of CTG-repeats in the 3' 
untranslated region (UTR) of the DM protein kinase gene (DMPK) on chromosome 19 35 . DM1 is a multisystemic 
disorder and the clinical features include myotonia, muscle degeneration, heart failure, ocular cataracts, impaired 
glucose tolerance, and mental retardation 6,7 . A dominant negative effect of the DMPK mutant allele through RNA 
gain-of-function has been proposed as the molecular disease mechanism. Many studies support a mechanism 
where toxic DMPK RNA with expanded CUG repeats binds to and sequesters proteins that are important for RNA 
metabolism including transcription, RNA transport, alternative splicing, translation, and yet unknown processes 6 . 
The expanded CUG repeats in the DMPK mRNA bind to and sequestrate MBNL1 in discrete nuclear foci, which 
results in depletion of functional MBNLl 8,9 . By a yet unknown mechanism the expanded CUG repeats also activate 
protein kinase C (PKC), which phosphorylates and stabilizes CUGBP1 1 ". Thus, the expanded CUG repeats 
contribute to DM1 pathogenesis by causing loss of MBNLl and gain of CUGBP1 activity 11 . 

Both CUGBP1 and MBNLl regulate postnatal transitions in alternative splicing patterns during striated 
muscle development" 1213 . Representative targets of CUGBP1 splicing regulation, which are misregulated in 
DM1 striated muscles, include genes for cardiac troponin T (TNNT2) 14,15 , insulin receptor (INSR) 16 , and chloride 
channel 1 (CLCN1) 15 ' 17 . MBNLl contains four CCCH-type zinc fingers that recognize a YGCY motif that is indeed 
observed in the CUG repeat (CUGCUG) 1821 . Mice deficient in Mbnll show aberrant splicing of Clcnl, Tnnt2, and 
Tnnt3, but not Inst 22 . Very recently, MBNLl was shown to regulate BIN1 alternative splicing, and dysregulation of 
BIN1 splicing in DM1 muscles was suggested to be part of the disease pathology resulting in muscle weakness 23 . 
Besides an important role in splicing regulation, CUGBP1 mediates mRNA decay of short-lived transcripts 
by interaction with GU-rich elements in the 3' UTR 24 27 . In addition, CUGBP1 increases the translation of 
CDKN1A 2 " and Me/2a 29 . In contrast to the multiple functionalities in posttranscriptional gene regulation of 
CUGBP1, MBNLl has so far been exclusively recognized as a splicing regulatory trans-iactor. 
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High-throughput sequencing of RNA isolated by crosslinking 
immunoprecipitation (HITS-CLIP) 30 is a new method that enables 
global mapping of targets for specific RNA-binding proteins in living 
cells, thereby shedding light on their role in regulation of RNA pro- 
cessing of known and unknown targets. 

In the present study, we performed HITS-CLIP analysis for 
CUGBP1 and MBNL1 on the mouse myoblast cell line C2C12 
to extensively characterize their RNA-binding sites and functional 
roles in RNA processing. We identified position-dependence of 
CUGBP1/MBNL1 -binding sites in regulating exon inclusion or 
skipping. Interestingly, we discovered that both CUGBP1 and 
MBNL1 preferentially bind to the 3' UTR and destabilize target 
mRNAs. This points to a new important role of MBNL1 and suggests 
that binding to the 3 ' UTRs and destabilization of mRNA are likely to 
be a fundamental function shared by CUGBP1 and MBNL1. 

Results 

Genome-wide CUGBP1/MBNL1-RNA interaction maps. In order 
to determine global CUGBP1/MBNL1 -binding sites in vivo, we 
performed HITS-CLIP experiments using the mouse myoblast cell 
line, C2C12. 

In C2C12 cells, CUGBP1 is constantly expressed throughout myo- 
blast differentiation, whereas expression of MBNL1 is low in undif- 
ferentiated cells and gradually increases during differentiation 
(Supplementary Fig. SI), as previously described 8 . We thus per- 
formed HITS-CLIP analysis of CUGBP1 and MBNL1 using undif- 
ferentiated and differentiated C2C12 cells, respectively. We also 
performed CLIP of MBNL1 using undifferentiated cells in three 
independent experiments, but this yielded an insufficient amount 
of RNA-protein complexes and failed to yield cDNA libraries suit- 
able for high-throughput sequencing. In the HITS-CLIP analysis of 
CUGBP1, our first experiment yielded 34,733,815 CLIP tags of 32 nt, 
of which 29,545,067 (85.06%) were mapped to the mm9 genome 
allowing at most 2 mismatches and placing reads mapping to mul- 
tiple locations to a single random site. A second CLIP experiment 
yielded 10,079,185 CLIP tags of 36 nt, of which 8,516,256 (84.49%) 
were mapped. In the first MBNL1 CLIP experiment, we obtained 
13,218,685 CLIP tags, of which 11,044,152 (83.55%) were mapped, 
while the second CLIP experiment yielded 1 3,474,600 CLIP tags with 
11,455,886 (85.02%) tags mapped to the mm9 genome. For the ana- 
lysis of binding motif and binding region annotation, we selected 
only reads that were aligned uniquely in the genome and removed all 
potential PCR duplicates by collapsing reads with an identical 5' start 
into a single read. This resulted in 177,013 and 130,828 CLIP tags 
from the two CUGBP1 CLIP experiments, while the two MBNL1 
experiments yielded 59,156 and 583,841 CLIP tags respectively. 

In an effort to confirm the specificity of our CLIP experiments, we 
performed CLIP analysis of polypyrimidine tract-binding protein 
(PTB), a multifunctional RNA-binding protein, using undifferenti- 
ated mouse C2C12 cells. We identified 12,841,778 CLIP tags of which 
11,184,829 (87.10%) were mapped to the mouse mm9 genome. 
Removal of non-uniquely aligned reads and PCR duplicates yielded 
307,995 unambiguous CLIP tags. 

Consensus motifs. To determine RNA-binding motifs associated 
with CUGBP1/MBNL1 in vivo, we used the motif-finding algori- 
thm, Multiple EM for Motif Elicitation (MEME) 31 . We used 
SeqMonk to identify likely binding regions, and identified 1,841 
CUGBP1 -binding regions and 302 MBNL1 -binding regions. Com- 
parison of SeqMonk's maximum depth scores between samples 
indicates that binding regions in each replicated experiment are 
highly overlapping, while PTB binding regions did not overlap 
with those of the other four CLIP experiments (Supplementary 
Fig. S2). The lower number of identified MBNL1 regions supported 
by two independent experiments (Supplementary Fig. S2b) was likely 
due to the large difference in the number of CLIP tags in the two 



MBNL1 experiments. The regions demonstrate enrichment of GU- 
rich motifs for CUGBP1 and YGCY-containing motifs for MBNL1 
(Fig. 1). 

Our in vivo binding motifs are in accordance with previously 
suggested binding motifs for CUGBP1 25 ' 32 and MBNL1 21 ' 33 35 . We 
identified 1,824 PTB binding regions in the mouse genome and 
detected a CU-rich motif, which is essentially identical to the motif 
for PTB recently identified by HITS-CLIP analysis of a human cell 
line 36 . 

We also analyzed the CUGBP1 and MBNL1 motifs enriched in 
regions containing reads with multiple potential mapping locations 
(Supplementary Fig. S3), and compared them with the motifs with 
unique mapping (Fig. 1). Following removal of potential PCR dupli- 
cates, we observed 699,382 tags that were non-uniquely aligned in the 
1st CUGBP1 CLIP experiment, 219,128 tags in the 2nd CUGBP1 
CLIP experiment, 105,432 and 216,882 tags in the two MBNL1 CLIP 
experiments respectively and finally 851,324 tags in the PTB CLIP 
experiment. We observed that enriched motifs in these regions (Sup- 
plementary Fig. S3) are very similar to the CUGBP1 and MBNL1 
motifs enriched in the binding regions containing uniquely aligned 
reads (Fig. 1), suggesting that these regions share the same properties 
as the uniquely aligned regions and that they may contain functional 
binding sites. 

HITS-CLIP analysis of splicing targets. We next studied the effects 
of CUGBP1/MBNL1 binding on alternative splicing. CUGBP1 tags 
are clustered in intronic regions flanking alternative rather than 



e-value 
1 .5e-256 
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e-value 
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Figure 1 | Binding motif analysis. WebLogos of consensus binding motifs 
of CUGBP1, MBNL1, and PTB generated by the MEME motif analysis 
tool. The likelihood of finding the indicated motif by chance is indicated as 
an E-value. 
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constitutive exons (Fig. 2a). MBNL1 tags are similarly clustered in 
intronic regions flanking alternative exons, and are also enriched in 
alternative and constitutive exons. In order to investigate if and how 
CUGBP1/MBNL1 binding around splice sites regulate alternative 
splicing, we knocked down these factors by siRNA in undif- 
ferentiated C2C12 cells (Supplementary Fig. S4a). We analyzed 
alterations of splicing globally using the Affymetrix Mouse Exon 
1.0 ST Array (GEO accession number, GSE29990) and identified 8 
CUGBP1 -responsive and 24 MBNLl-responsive exons (Supplemen- 
tary Table 1, Figs. S5 and S6abc). We also analyzed 29 CUGBP1- 
tagged and 51 MBNL1 -tagged exons/introns known to be alternatively 
spliced according to the ENSEMBL version e/61, and identified 16 
CUGBPl-responsive and 21 MBNLl-responsive exons by RT-PCR 
(Supplementary Figs. S5 and S6abc). We made the compiled dataset 



C, which is comprised of the 24 CUGBPl-regulated exons (15 
skipped and 9 included), as well as the compiled dataset M con- 
sisting of the 45 MBNL1 -regulated exons (25 skipped and 20 
included). The datasets include 1 and 9 previously identified target 
exons of CUGBP1 and MBNL1, respectively (Supplementary 
Fig. S5). In addition, 9 exons are shared between datasets C and 
M. Mbnll siRNA sufficiently suppressed MBNL1 expression up to 
day 3 after differentiation (Supplementary Fig. S4b), and we observed 
that as many as 44 of the 45 MBNL1 -regulated exons in dataset M 
respond similarly to MBNL1 knockdown in both differentiated and 
undifferentiated cells (Supplementary Figs. S4 and S5). 

We also made dataset M2 that includes 26 additional MBNL1- 
dependent cassette exons (15 skipped and 11 included) that were 
previously identified in skeletal muscle of MBNL1 knockout mice 
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Figure 2 | Mapping of CLIP-tags on exon-intron structures, (a) Distributions of CLIP-tags on constitutively or alternatively spliced exons and the 
flanking intronic regions. The abscissa indicates an intron-exon-intron structure. The sizes of all the exons are normalized to 1 50 nucleotides. Numbers of 
exonic CLIP-tags are also normalized accordingly. Intronic CLIP-tags within 500 nucleotides upstream or downstream of exons are indicated. The 
number of CLIP-tags is normalized for the number of transcripts belonging to either category of constitutive and alternative exons. (b) Normalized 
complexity map of CUGBP1 at CUGBP1 -dependent splice sites. Twenty- four CUGBPl-regulated splicing events in dataset C in undifferentiated C2C12 
cells are compiled, (c) Normalized complexity map of MBNL1 at MBNLl-dependent splice sites. Forty-four MBNL1 -regulated splicing events in 
differentiated C2C12 cells in dataset M are compiled, (d) Normalized complexity map of CUGBP1 at MBNLl-dependent splice sites. Sixty MBNL1- 
regulated splicing events in undifferentiated C2C12 cells in datasets M and M2 are compiled, (e) Normalized complexity map of MBNL1 at CUGBP1- 
dependent splice sites. Twenty-four CUGBPl-regulated splicing events in undifferentiated C2C12 cells in dataset C are compiled. Shaded areas represent 
an average of 100 sets of normalized complexity of 50 (b, c, and d) and 15 (e) randomly selected constitutive exons. Arrows indicate representative peaks 
that are explained in Results. Graphs represents results of the 2nd CLIP experiments for both CUGBP 1 and MBNL 1 . Resutls of the 1 st CLIP experiments 
are shown in Supplmentary Fig. S7. 
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(Supplementary Table S2) 33 , and found that 18 exons are similarly 
regulated by Mbnll knockdown in undifferentiated C2C12 cells 
(Supplementary Fig. S6d and Table S2). 

We combined datasets C and M into a single composite pre- 
mRNA and made integrated RNA maps from our HITS-CLIP reads 
mapped to the corresponding genomic regions as previously 
described for Nova 30 and PTB 36 . This showed that CUGBP1 binding 
to upstream intronic regions facilitates exon skipping, whereas 
CUGBP1 binding to downstream intronic regions promotes exon 
inclusion (closed arrows in Fig. 2b and Supplementary Fig. S7a). 
Results of the 2nd experiments are shown in Fig. 2 and those of 
the 1st experiments are in Supplementary Fig. S7. In contrast, 
although the binding sites of MBNL1 are more diffusely distributed 
and less abundant in regions flanking splice sites (Fig. 2c), MBNL1 
binding close to the 3' end of the downstream intron induces exon 
skipping (closed arrow in Fig. 2c and Supplementary Fig. S7b). The 
presence of a similar peak in dataset M2 (closed arrow in Supple- 
mentary Fig. S7c) further supports this observation. 

We next analyzed the interaction between CUGBP1 and MBNL1 
in splicing regulation. We made an RNA map of CUGBP1 -binding 



sites in MBNL1 -regulated exons from datasets M and M2 (Fig. 2d 
and Supplementary Fig. S7e), as well as an RNA map of MBNL1- 
binding sites in CUGBP1 -regulated exons from dataset C (Fig. 2e and 
Supplementary Fig. S7f). Both RNA maps demonstrate the presence 
of CUGBP1 clusters in MBNL1 -responsive exons and vice versa, 
which suggests that CUGBP1 and MBNL1 are likely to regulate 
alternative splicing of some of the same exons. 

MBNL1 and CUGBP1 both preferentially bind to the 3' UTR. 

MBNL1 has so far solely been categorized as an exon/intron- 
binding splicing regulatory protein 6 , but to our surprise we found 
that the majority (55%) of MBNL1 -binding regions are located 
in 3' UTRs (Fig. 3a). The same pattern with preferential binding 
(53%) in 3' UTRs is observed for CUGBP1, while only 2% of PTB 
binding regions are located in 3' UTRs (Fig. 3a). Similarly, when 
HITS-CLIP tags are mapped to the size- normalized positions of all 
the genes in the mouse genome, CUGBP1 and MBNL1 CLIP tags, but 
not PTB CLIP-tags, are enriched close to the 3' ends of genes (Fig. 3b). 
Additionally, 610 3' UTRs, which constitutes 28.7% of the CUGBP1- 
tagged 3' UTRs and 17.4% of the MBNLl-tagged 3' UTRs, are shared 
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Figure 3 | Enrichment of CUGBP1 and MBNL1 CLIP-tags in the 3' UTR. (a) Distributions of CUGBP1, MBNL1, and PTB binding regions. Binding 
regions are mapped to CDS (coding sequence), 5' and 3' UTRs, introns, intergenic regions (incl. tRNA and rRNA genes), or antisense within genes 
according to the UCSC knownGene annotation of the NCBI Build 37.1 mouse genome (mm9). Pie-charts show ratios of binding regions mapped to the 
indicated regions, (b) Distributions of CUGBP1, MBNL1, and PTB CLIP-tags mapped to the relative positions of all the mouse genes. The relative 
positions of the genes are shown in percentages of the gene length in abscissa. The broken lines represent 15,638 genes with constitutive transcriptional 
start and end sites (Cons), and the solid lines represent 7,477 genes with alternative transcriptional start or end site (Alt) . (c) Venn diagram of the numbers 
of genes with CUGBP1-, MBNL1-, and PTB-binding regions within the 3' UTR. Binding regions were identified using the SeqMonk software. 
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between these two proteins (Fig. 3c). All these data document that 
both CUGBP1 and MBNL1 preferentially bind to 3' UTRs, indicating 
that this is a key function of both proteins in RNA processing. This 
suggests that the functional repertoire of MBNL1 should be expanded 
and that MBNL1, from being primarily regarded as regulator of 
alternative splicing, should also be considered as an important 
regulator of 3' UTR- mediated processes, such as mRNA stability/ 
degradation. 

MBNL1 destabilize mRNAs. To analyze the function of CUGBP1/ 
MBNL1 binding to 3' UTRs, we made luciferase reporter constructs 
harboring CUGBPl/MBNLl-binding sites in the 3' UTR. Since no 
CLIP tags were observed in the 3' UTR of Gapdh (Supplementary 
Fig. S8), we made a luciferase-Gapdh 3' UTR expression vector, and 
then inserted 12 repeats of GT and 7 repeats of CTG immediately 
after the stop codon of luciferase to introduce a CUGBP 1 -binding site 
(GU rich motif) and an MBNL1 -binding site (YGCY motif), 
respectively (Fig. 4a). We also inserted 12 AC repeats as a control. 
Due to the high expression level of CUGBP 1 in C2C12 cells we used 
HEK293 cells for transient transfection of these reporter constructs 
along with CUGBP1/MBNL1 expression vectors. For the constructs 
with Gapdh 3' UTR alone or with AC repeats inserted, over- 
expression of CUGBP 1 or MBNL1 had no effect on luciferase 
activity (Fig. 4b). For the GT repeat construct, overexpression of 
CUGBP 1 decreased the luciferase activity, but MBNL1 had no 
effect. For the CTG repeat construct overexpression of MBNL1 
dramatically decreased the luciferase activity, and also overex- 
pression of CUGBP 1 significantly reduced luciferase activity 
(Fig. 4b). In order to shed light on the mechanism underlying the 
observed decrease in luciferase activity we investigated the decay 
of luciferase mRNA. The SV40 promoter of the luciferase reporter 
constructs was replaced with a tet-repressible promoter, and 
HEK293 Tet-off cells were transiently transfected with these 
constructs. Doxycycline was added to the medium to stop tran- 
scription of the tet-responsive promoter, and the temporal profiles 
of luciferase and GAPDH mRNA levels were measured. Over- 
expression of MBNL1 together with the CTG repeat reporter 



construct resulted in highly increased decay of luciferase mRNA 
and CUGBP 1 overexpression together with the GT repeat reporter 
construct also increased mRNA decay. Overexpression of either 
protein together with the Gapdh 3' UTR control construct did not 
alter mRNA decay (Fig. 4c). These data demonstrate that binding of 
CUGBP 1 and MBNL1 to the 3' UTR promotes mRNA decay. To 
examine whether CUGBP 1 and MBNL1 regulate decay of endo- 
genous mRNAs, we next analyzed mRNA stability in actinomycin 
D treated C2C12 cells by expression arrays following siRNA knock 
down of CUGBP1 or MBNL1 (GEO accession number, GSE27583). 
To identify genes with reliable half-life estimates, we restricted 
our analysis to 195 transcripts using three conditions: (i) half-life 
between 2.5-5 hrs; (ii) correlation coefficient of fitting to an 
exponential decay greater than 0.9; and (iii) RMA-normalized 
signal values more than 100 at all time points. The median half-life 
of all the transcripts matching these criteria in the control is 3.56 hrs, 
whereas those from CUGBP 1- and MBNL1 -knocked down cells 
are significantly prolonged to 3.91 hrs and 3.73 hrs, respectively 
(Fig. 5a). We chose four additional representative mRNAs with a 
cluster of either CUGBP1- or MBNLl-tags in the 3'UTR, and 
confirmed by real time PCR that knockdown of either CUGBP 1 or 
MBNL1 results in approximately two-fold increase in mRNA half- 
life of these target mRNAs (Fig. 5b). The half-lives of 100 out of 195 
transcripts are prolonged both by knockdown of CUGBP 1 and 
MBNL1, suggesting overlapping activity in the regulation of 
mRNA decay by CUGBP 1 and MBNL1. We next analyzed the 
relationship between change in mRNA half-life and coverage of 
HITS-CLIP tags in the 3' UTRs. We found that genes displaying 
prolongation of half-lives in response to CUGBP 1 knockdown 
harbors more CUGBP 1 -tags in their 3' UTRs, compared to those 
displaying shortening of half-lives (Fig. 5c). Similarly, genes that 
display prolongation of their half-lives in response to MBNL1 
knockdown have more MBNLl-tags in their 3' UTRs (Fig. 5c). 

Gene Ontology analysis of CUGBP 1/MBNL1 -bound 3' UTRs 
revealed that the terms 'cytoskeletal protein binding, 'transcription 
factor binding' and 'RNA binding' are significantly overrepresented 
for CUGBP 1- and MBNL1 -bound genes (Table 1). 
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Figure 4 | Decay of luciferase mRNA by overexpression of CUGBP1/MBNL1. (a) Schemes of luciferase reporter plasmids harboring Gapdh 3' UTR. 
Each construct was made carrying either SV40 or tet-responsive promoter, (b) Luciferase activity after overexpression of CUGBP 1/MBNL1. HEK293 cells 
were transfected with the indicated SV40-driven luciferase reporter constructs. Luciferase activity is normalized for the transfection efficiency using co- 
transfection of pRL/SV40. (c) Decay of luciferase mRNA after overexpression of CUGBP1/MBNL1. HEK293 Tet-off cells were transfected with the 
indicated tet-responsive promoter-driven luciferase reporter constructs. Doxycycline was added to the medium to stop transcription at time 0. Temporal 
profiles of luciferase mRNA decay were quantified by real time RT-PCR and are normalized for Gapdh mRNA levels. All experiments were triplicated, and 
the mean and s.d. are indicated (* p < 0.05; ** p < 0.01). 
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Figure 5 | Global analysis ofmRNA decay by expression array of C2C12 cells treated with CUGBP1/MBNL1 siRNA. (a) Half-lives of mRNAs in C2C12 
cells with the indicated siRNAs. Red lines represent means and 95% confidence intervals. ** p < 0.01 and *** p < 0.001. (b) Real-time RT-PCR analysis of 
the stability of four representative endogenous mRNAs, which were detected by expression arrays. CLIP-tag distributions are shown above each gene 
structure. C2C12 cells were treated with either control (ctl), CUGBP1 (Cug), or MBNL1 (Mb) siRNA. Actinomycin D was added to the medium to stop 
transcription at time 0. Temporal profiles of decay of the indicated genes were analyzed by real-time RT-PCR and are normalized for Gapdh mRNA levels. 
All experiments were triplicated, and the mean and s.d. are indicated (* p < 0.05 and ** p < 0.01). (c) Tag counts in the 3' UTR of each gene are plotted in 
two categories of prolonging (up) and shortening (down) ofhalf-lives after MBNL1 and CUGBP1 siRNAs. Red lines represent means and 95% confidence 
intervals. ** p < 0.01. Tag counts were normalized by the gene expression level at 0 h of cells treated with control siRNA. 



Table 1 | 


The five most frequent Gene Ontology terms of 


mRNAs that are bound by CUGBP1 and MBNL1 to the 


3' UTR 


CLIP data 


GO ID 


Term 


P Value 


CUGBP1 


GO:0008092 


cytoskeletal protein binding 


1.58E-06 




GO:0003723 


RNA binding 


1 .40E-04 




GO:0008134 


transcription factor binding 


9.65E-04 




GO:0051082 


unfolded protein binding 


0.003184 




GO:00 19904 


protein domain specific binding 


0.006603 


MBNL1 


GO:0008092 


cytoskeletal protein binding 


7.31E-20 




GO:0008134 


transcription factor binding 


2.20E-08 




GO:0003723 


RNA binding 


0.001893 




GO:0019899 


enzyme binding 


0.002046 




GO:0032553 


ribonucleotide binding 


0.004210 


We utilized the mRNAs that have more than 8-fold coverage of CLIP tags in their 3'UTR for the analysis by DAVID 53 ' 54 . 



SCIENTIFIC REPORTS | 2 : 209 | DOI: 1 0. 1 038/srep00209 



6 



www.nature.com/ scientificreports 




PITX2 is a homeobox transcription factor that regulates left-right 
asymmetric morphogenesis 37 ' 38 and it is also deeply implicated in 
myogenesis during mouse embryonic development 39 41 . We found 
that the decay of Pitx2 mRNA is prolonged by knocking down 
MBNLl, but not CUGBPl in undifferentiated C2C12 cells (Fig. 6b 
and c). This is consistent with the fact that Pitx2 harbors a much 
higher number of MBNL1-CLIP tags than that of CUGBP1-CLIP 
tags in the 3' UTR (Fig. 6a). We also observed that down regulation 
of both CUGBPl and MBNLl decreases the decay of Myodl and 
Mbnl2 mRNA, but not that of Gapdh mRNA (Supplementary 
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Figure 6 | MBNLl binds to 3' UTR oiPitx2 and facilitates decay of Pitx2 
mRNA. (a) Distributions of MBNLl CLIP -tags in the P;'»2-gene 
structure, and schemes of luciferase reporter plasmids harboring wild-type 
(Pi(x2-3TJTR) and mutated (mut-3UTR) 3' UTRs of Pitx2. Red bars 
indicate locations of native YGCY motifs and yellow bars indicate those of 
mutated YGCY motifs. Individual mutations disrupting YGCY motifs are 
shown at the bottom. We introduced U at the 2nd YGCY motif to prevent 
formation of AU and AC repeats, which constitute a potential binding site 
of Hu proteins and hnRNP L, respectively, (b and c) Decay of endogenous 
Pitx2 mRNA by siRNA-mediated knockdown of CUGBP 1 (b) and MBNLl 
(c). C2C12 cells were treated with either control (ctl), CUGBPl (Cug) or 
MBNLl (Mb) siRNA. Actinomycin D was added to the medium to stop 
transcription at time 0. Temporal profiles of decay of the indicated genes 
were analyzed by real-time RT-PCR and were normalized for Gapdh 
mRNA levels. All experiments were triplicated, and the mean and s.d. are 
indicated (* p < 0.05; ** p < 0.01; *** p < 0.001). (d) Decay of luciferase 
mRNA in the Pitx2-3UTR and mut-3UTR constructs after either control 
(ctl) or MBNLl (Mb) siRNA in TetON-C2C12 cells. Culture medium was 
replaced by medium without doxycycline to stop transcription at time 0. 
Temporal profiles of luciferase mRNA decay are quantified by real-time 
RT-PCR and are normalized for Gapdh mRNA levels. All experiments were 
triplicated, and the mean and s.d. are indicated (** p < 0.01). (e) 
Immunoblot of PITX2 in undifferentiated C2C12 cells at 48 hrs after 
transfection of MBNLl siRNA. 



Fig. S8). Similarly, down regulation of CUGBPl decreases the decay 
of other myogenic transcription factors such as Myog and Mef2a 
mRNAs, and also of Cugbp2 (Supplementary Fig. S9). Further- 
more, knockdown of CUGBPl and MBNLl prolongs decay of 
Mbnll and Cugbpl mRNAs, respectively, suggesting a mechanism 
for cross-regulation of expression of MBNLl, CUGBPl, and their 
family proteins (Supplementary Fig. S8). 

To analyze more directly the role of MBNL 1 binding to the 3 ' UTR 
in regulation of mRNA decay, we examined the mRNA stability of 
firefly luciferase fused with the 3 ' UTR of Pitx2 (Fig. 6a) . There are 1 1 
YGCY motifs in the 3' UTR of Pitx2, and 4 of the 11 motifs have 
MBNL1-CLIP tags. We introduced artificial mutations in these 4 
motifs to prevent binding of MBNLl (Fig. 6a). Consistent with the 
proposed role for MBNLl in mRNA decay, we observe that disrup- 
tion of the MBNLl -binding motifs in the Pitx2-3' UTR abolished 
responsiveness to MBNLl knockdown (Fig. 6d). Furthermore, 
immunoblots demonstrated that MBNLl -knockdown enhanced 
expression of endogenous PITX2 in C2C12 cells (Fig. 6e). These data 
suggest that MBNLl promotes decay of Pitx2 mRNA and thereby 
represses expression of the PITX2 protein. 

Taken together, all of our data are consistent with a model where 
CUGBPl and MBNLl facilitate mRNA decay through binding to the 
3' UTR of target genes. 

Discussion 

CUGBPl and MBNLl are developmental^ regulated RNA-binding 
proteins that are causally associated with myotonic dystrophy type 1. 
In this study, we show that both CUGBPl and MBNLl preferentially 
bind to 3' UTRs and destabilize the bound mRNAs. In particular, we 
show that CUGBPl and MBNLl destabilize myogenic differentiation 
factors and RNA-binding proteins. In addition, our results confirm 
and significantly expand the current knowledge of the splicing- 
regulatory effects of CUGBPl and MBNLl. Taken together, the data 
from the present study indicates that CUGBPl and MBNLl are 
closely related and cross regulate alternative splicing and mRNA 
decay. 

MBNLl binding to 3' UTRs has not been previously reported. We 
show for the first time that MBNLl binds to 3' UTRs and promotes 
mRNA decay in both artificial constructs and in endogenous genes. 
We also demonstrate by expression arrays that both CUGBPl and 
MBNLl facilitate mRNA decay by binding to 3' UTRs. The present 
study demonstrates global in vivo interactions between CUGBPl and 
3' UTRs and reveals that CUGBPl also preferentially binds to 3' 
UTR rather than exons/introns. We provide in vivo evidence that 
CUGBPl facilitates mRNA decay of a broad spectrum of genes in 
addition to the previously reported genes 25-27-42-44 . 

Interestingly, we find that MBNLl promotes decay of Cugbpl 
mRNA and that CUGBPl promotes decay of Mbnll mRNA, and 
that this is associated with corresponding changes at the protein level 
during differentiation of C2C12 cells (Supplementary Fig. S4b). This 
may suggest that expression of CUGBPl and MBNLl are mutually 
regulated in myogenic differentiation. Kuyumcu-Martinez and col- 
leagues report that expanded CUG repeats of DMPK through an 
unknown mechanism leads to phosphorylation and thereby to sta- 
bilization of CUGBPl in DM1 myoblasts 10 . Our studies additionally 
suggest that loss of MBNLl in DM1 could lead to decreased decay of 
CUGBPl mRNA and hence to further increase of CUGBPl activity. 
Although CUGBPl is not upregulated in adult MBNLl -knockout 
mice, this mechanism could lead to increased misregulation of splic- 
ing and decay of the mRNAs of target genes in embryonic develop- 
ment that culminates in the DM1 phenotype. 

We find that binding sites for CUGBPl and MBNLl are enriched 
around alternative cassette exons (Fig. 2a). The binding sites for 
CUGBPl are prominent in adjacent intronic regions flanking 
alternative exons. Our functional analysis reveals that binding of 
CUGBPl to the upstream intron facilitates exon skipping, whereas 
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binding to the downstream intron enhances exon inclusion (Fig. 2b). 
Interestingly, similar regulation of alternative splicing has been 
observed for NOVA, FOX2 and PTB 30,45,46 , indicating the presence 
of a common underlying mechanism shared by these proteins. 

In contrast to CUGBP1, MBNL1 tags are also enriched in coding 
exons. Until now, splicing ds-elements of MBNL1 have been mapped 
exclusively to introns, and no exonic ris-element has been reported to 
our knowledge 20 ' 2334 ' 47,48 . Although MBNL1 preferentially binds to 
exons, MBNL1 binding to introns is enriched at alternative rather 
than constitutive splice sites (Fig. 2a. This enrichment is diffusely 
distributed throughout regions harboring 500 nt upstream or down- 
stream of alternative exons, in contrast to the prominent intronic 
peaks observed for CUGBP1 tags. This could suggest that MBNL1 
needs to bind simultaneously to the target exon and adjacent introns 
to regulate splicing. Functional analysis of MBNL1 reveals that bind- 
ing of MBNL1 close to the 3' end of the downstream intron facilitates 
exon skipping, whereas no characteristic binding pattern is observed 
for exons included in response to MBNL1 (Fig. 2c). PTB has also 
been reported to regulate alternative splicing through binding close 
to the 3' end of the downstream intron 36 . In contrast to MBNL1, 
however, binding of PTB to this region promotes exon inclusion. We 
similarly find binding of PTB to this region in our HITS-CLIP data in 
MBNL1 -regulated exons (Supplementary Fig. S7d). Interestingly, 
the MBNL1 -binding motif is enriched in PTB-regulated exons 46 . 
MBNL1 may thus compete for binding with other splicing factors 
like PTB and regulate alternative splicing events. 

Post-transcriptional gene expression regulation is crucial to 
achieve precise developmental and tissue- specific control of cellular 
processes. Our studies reveal that CUGBP1 and MBNL1 preferen- 
tially bind to the 3' UTRs of mRNAs encoding RNA-binding pro- 
teins and transcription factors, which can regulate cell development. 
During development of murine skeletal muscles, the nuclear level of 
MBNL1 increases, while that of CUGBP1 decreases 9,12 . Genes with 
mRNAs that can be bound both by CUGBP1 and MBNL1 are likely 
to be down- regulated by CUGBP1 in undifferentiated cells. If these 
genes need to be tightly down-regulated also in differentiated cells, 
MBNL1 can substitute for CUGBP1 in order to achieve continued 
destabilization of the target mRNA. We conclude that finely-tuned 
expression of CUGBP1 and MBNL1 maybe important regulators of 
myogenic differentiation through precise regulation of both alterna- 
tive splicing and mRNA stability. 

Methods 

Antibodies. Antibodies to CUGBP1 (3B1), MHC (H300), myogenin (M225) and 
PTB (N20) were purchased from Santa Cruz Biotechnology. Anti-GAPDH pAb was 
purchased from Sigma. Anti-PITX2 pAb was purchased from Abeam. Anti-MBNLl 
rabbit serum (A2764) was a kind gift of Dr. Charles A. Thornton at University of 
Rochester. The specificity of antibodies against CUG BP1 and MBNL1 is supported 
by the data in previous reports 2,3 and also by our siRNA experiments 
(Supplementary Fig. SI). 

Cell culture. Detailed methods are included in the Supplementary Information. 

HITS-CLIP. C2C12 cells were UV-irradiated at 400 mj and CLIP was performed as 
previously described 49 . High -throughput 36-bp single-end and 40-bp single-end 
sequencing was performed using an Illumina Genome Analyzer II. All HITS-CLIP 
data were registered in ArrayExpress with an accession number E-MTAB-414 and in 
ENA with an accession number ERP000789. Detailed information is provided in the 
Supplementary Information. 

Bioinformatics analysis. Illumina reads were first prepared by removing the 4-bp tag 
and filtering sequences composed primarily of Illumina adapter. The resulting reads 
were mapped to the mouse genome (NCBI Build 37. l/mm9) with default parameters 
using the BWA 50 mapping software. To extract consensus motifs from the mapped 
reads, we considered only uniquely aligned reads and first removed duplicate reads to 
avoid potential PCR-mediated deviations in addition to bias from very highly 
expressed transcripts. We then extended the reads to 110 nt, the expected mean of the 
CLIP fragments and used the SeqMonk software {www.bioinformatics.bbsrc.ac.uk/ 
projects/seqmonk) to identify binding regions by using the program's built-in peak 
detection algorithm. Peaks were scored using both a reads per peak scoring scheme 
and a maximum depth scoring scheme {effectively the height of the peak) in order to 
filter out peaks. For the identification of CUGBP1- and MBNLl-binding regions, we 



used PTB as a negative control and removed peaks present in the PTB dataset as well. 
We then selected CUGBP1 peaks that were present in the two independent CUGBP1 
CLIP experiments and MBNL1 peaks that were similarly corroborated by the two 
MBNL1 experiments. PTB binding regions were identified by removing peaks that 
were present in either of the four CUGBP1 and MBNL1 experiments. Finally, we 
restricted the set of binding regions to only those spanning 70-150 bp since this was 
the fragment length used in the CLIP experiments. We analyzed each dataset using a 
motif analysis tool, MEME 31 , using a background Markov model based on the entire 
mouse genome. 

We analyzed the mapped Illumina reads and binding regions and mapped them to 
UCSC knownGene annotations 51 of the mouse genome (NCBI Build 37.1/mm9) by 
writing and running Perl and Excel VBA programs, as well as by running BEDTools 
utilities 52 . Normalized complexity maps of CUGBP1/MBNL1/PTB-RNA interactions 
were generated as previously described 30 . For the control, normalized complexity map 
was similarly generated by analyzing 100 sets of 15 to 50 constitutive exons that were 
randomly selected from 118,969 constitutive exons in mm9. To identify enriched 
Gene Ontology terms, we used the Database for Annotation, Visualization and 
Integrated Discovery (DAVID 6.7) 53 - 54 . 

Construction of plasmids. To construct luciferase reporter vectors with the 3' UTR 
of Gapdh and Pitx2, 3' UTRs of these genes were amplified by PCR. Amplified DNA 
was ligated into the Xbal and BamHl sites of the pGL3-promoter vector (Promega) to 
substitute for the 3' UTR of the firefly luciferase gene. DNA fragments harboring GT 
and CTG repeats were amplified by self-priming PCR using primers terminating in a 
Xbal site, and ligated into the Xbal site to make the pGL3P -Gapdh-3' UTR. 

To construct tet-responsive luciferase constructs, the tet-responsive promoter 
region was excised from pTRE-Tight vector (Clontech) with Xhol-Hindlll site and 
cloned into the Xhol-Hindlll site of the pGL3-promoter vector with the 3' UTR of 
Gapdh and Pitx2. To introduce mutations in 3' UTR of Pitx2 in the luciferase con- 
struct, we used the QuikChange site-directed mutagenesis kit (Stratagene). 

To construct expression vectors for MBNL1 and CUGBP1, the human MBNL1 
cDNA and human CUGBP1 cDNA (Open Biosystems) were subcloned into the 
mammalian bidirectional expression vector pBI-CMV2 (Clontech), which should 
constitutively expresses the insert and AcGFPl. 

RNA interference and transfection. The siRNA duplexes against CUGBP1 and 
MBNL1 were synthesized by Sigma. The sense sequences of the siRNAs were as 
follows: Cugbpl-1, 5'-GCUUUGGUUUUGUAAGUUA-3'; Cugbpl-2, 5'-GGCUU- 
AAAGUGCAGCUCAA-3'; Mbnll-1, 5' -CACUGGAAGUAUGUAGAGA-3 ' ; and 
Mbnll-2, 5'-GCACAAUGAUUGAUACCAA-3'. We purchased the AllStar 
Negative Control siRNA (1027281) from Qiagen. C2C12 cells were seeded on 24- well 
plates, and transfected with siRNA using Lipofectamine 2000 (Invitrogen) according 
to the manufacturer's instructions. Tet-off advanced HEK293 cells were seeded on 
96-well plates, and were transfected with luciferase reporter gene constructs using 
FuGENE 6 (Roche) according to the manufacturer's instructions. At 48 hrs after 
transfection, cells were either harvested for RNA extraction or processed for isolation 
of total proteins or nuclear extracts. 

RT-PCR for splicing analysis. Total RNA was extracted using Trizol (Invitrogen) 
according to the manufacturer's instructions. cDNA was synthesized using an oligo- 
dT primer and ReverTra Ace (Toyobo), and PCR amplifications were performed 
using GoTaq (Promega) for 30-35 cycles. Sequences of the primers used for PCR are 
listed in the Supplementary Table S3. The intensities of PCR-amplified spliced 
products were quantified with the ImageJ 1.42q software (NIH). We then calculated a 
percentage of exon inclusion (% inclusion) as the ratio of the intensity of the upper 
band divided by the sum of intensities of all the bands. 

Real-time RT-PCR for RNA stability analysis. Total RNA was extracted using 
RNeasy mini kit (Qiagen) or CellAmp Direct RNA Prep Kit (Takara) according to the 
manufacturer's instructions. cDNA was synthesized as described above and real-time 
PCR was performed using the Mx3005P QPCR System (Stratagene) and the SYBR 
Premix Ex Taq II (Takara). Sequences of the primers used for PCR are listed in 
Supplementary Table S4. 

Microarray analysis. Total RNA was extracted using the RNeasy mini kit according 
to the manufacturer's instructions. We synthesized and labeled cDNA fragments 
from 100 ng of total RNA using the GeneChip WT cDNA Synthesis Kit (Ambion). 
The labeled cDNAs were hybrized to the Affymetrix Mouse Exon 1.0 ST Arrays for 
splicing analysis or the Affymetrix Mouse Gene 1.0 ST Arrays for analyzing temporal 
profiles of expression of CUGBPl/MBNLl-targeted genes following the 
manufacturer's protocols. The robust multichip analysis (RMA) algorithm was used 
to normalize the array signals across chips with the Affymetrix Expression Console 
software 1.1.2. All microarray data were uploaded to the Gene Expression Omnibus 
database (accession numbers, GSE29990 for exon arrays and GSE27583 for 
expression arrays). 

Western blotting. For preparation of total cell lysates, cells were lysed in buffer A 
(10 mM HEPES pH 7.8, 10 mM KC1, 0.1 mM EDTA, 1 mM DTT, 2 ug/ml 
Aprotinin, 0.5 mM PMSF, 0.1% NP-40) and incubate on ice for 20 min. After 
sonication, samples were centrifuged (15,000 rpm, 5 min) and the supernatants were 
stored at — 80°C for further experiments. For preparation of nuclear cell lysates, cells 
were suspended in 400 ul of buffer A. Nuclei were pelleted, and the cytoplasmic 
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proteins were carefully removed. The nuclei were then resuspended in buffer C 
(50 raM HEPES pH 7.8, 420 mM KC1, 0.1 raM EDTA, 5 mM MgCl 2 , 2% Glycerol, 
1 mM DTT, 2 ug/ml Aprotinin, and 0.5 mM PMSF). After vortexing and stirring for 
20 min at 4 J C, the samples were centrifuged, and the supernatants were stored at 
— 80 C. Samples were analyzed on a 10% SDS polyacrylamide gel, and the proteins 
were transferred to Immobilon polyvinylidene difluoride membranes (Millipore). 
Membranes were blocked with 1% BSA in Tris-buffered saline containing 0.05% 
Tween20 (TBST) for 1 hr, incubated for 1 hr with primary antibodies in TBST, 
washed three times with TBST, and incubated for 1 hr with horseradish peroxidase- 
conjugated anti-mouse or -rabbit immunoglobulin (GE) diluted 1 : 5,000 in TBST. 
After three washes in TBST, the blot was developed with the enhanced 
chemiluminescence system (GE) according to the manufacturer's instructions. 

Luciferase assay. HEK293 cells seeded on a 96 well plate were transfected with 10 ng 
of pGL3P -Gapdh-3' UTR with or without GT and CTG repeats, 5 ng of pRL/SV40 
(Promega), and 40 ng of pBI-CMV2-based CUGBP1 or MBNL1 expression vector 
using FuGENE 6. At 48 hrs after the transfection, the luciferase activity was measured 
using the Dual- Luciferase Reporter Assay System (Promega) according to the 
manufacturer's instructions. 
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